Automatic Headline Generation for Newspaper Stories

نویسنده

  • David Zajic
چکیده

In this paper we propose a novel application of Hidden Markov Models to automatic generation of informative headlines for English texts. We propose four decoding parameters to make the headlines appear more like Headlinese, the language of informative newspaper headlines. We also allow for morphological variation in words between headline and story English. Informal and formal evaluations indicate that our approach produces informative headlines, mimicking a Headlinese style generated by humans.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hedge Trimmer: A Parse-And-Trim Approach To Headline Generation

This paper presents Hedge Trimmer, a HEaDline GEneration system that creates a headline for a newspaper story using linguistically-motivated heuristics to guide the choice of a potential headline. We present feasibility tests used to establish the validity of an approach that constructs a headline by selecting words in order from a story. In addition, we describe experimental results that demon...

متن کامل

On newspaper headlines as relevance optimizers

This paper suggests an explanatory functional characterization of newspaper headlines. Couched within Sperber and Wilson’s (1986) relevance theory, the paper makes the claim that headlines are designed to optimize the relevance of their stories for their readers: Headlines provide the readers with the optimal ratio between contextual effect and processing effort, and direct readers to construct...

متن کامل

Headliner: An integrated headline suggestion system

Headline generation is a short-form variant of document summarization that has been studied in natural language processing. This paper presents a case study examining the application of several different headline generation models at The Washington Post. Currently for individual news articles, multiple different headlines are manually written in order to target different platforms such as the w...

متن کامل

Topic Tracking Based on Linguistic Features

This paper explores two linguistically motivated restrictions on the set of words used for topic tracking on newspaper articles: named entities and headline words. We assume that named entities is one of the linguistic features for topic tracking, since both topic and event are related to a specific place and time in a story. The basic idea to use headline words for the tracking task is that he...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002